Weak Convergence Results for Multiple Generations of a 

Branching Process 



We establish limit theorems involving weak convergence of multiple generations of critical 
and supercritical branching processes. These results arise naturally when dealing with the joint 
asymptotic behavior of functionals defined in terms of several generations of such processes. 
Applications of our main result include a functional central limit theorem (CLT) , a Darling-Erdos 
result, and an extremal process result. The limiting process for our functional CLT is an infinite 
dimensional Brownian motion with sample paths in the infinite product space (Co[0, 1])°°, with 
the product topology, or in Banach subspaces of (Cq[0, 1])°° determined by norms related to 
the distribution of the population size of the branching process. As an application of this CLT 
we obtain a central limit theorem for ratios of weighted sums of generations of a branching 
processes, and also to various maximums of these generations. The Darling-Erdos result and 
the application to extremal distributions also include infinite dimensional limit laws. Some 
branching process examples where the CLT fails are also included. 
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1 Introduction 



Our interest in limit theorems for multiple generations of branching processes is motivated by both 
practical and theoretical considerations. The practical side stems from the use of branching processes 
to model certain aspects of scientific experiments. One such problem area is Polymerase Chain 
Reaction (PCR) experiments. In such an experiment, an initial amount of DNA is amplified for 
use in various biological experiments. The PCR experiment evolves in three phases; an exponential 
phase, a linear phase, and a pleateau phase, with branching processes and their variants frequently 
used to model the exponential phase. One of the goals of such experiments is to "quantitate" the 
initial number of DNA molecules in a sample or cquivalently, estimate the number of ancestors in a 
branching process [5] . The statistical estimate of the initial number of ancestors is a function of the 
estimate of the mean of the branching process [5], and in order to make this estimate, data are used 
from the last few cycles (generations) at the end of the exponential phase. Since the cycle(generation) 
corresponding to the end of exponential phase is somewhat arbitrary, it is natural to consider the 
joint distributions of the generations involved to determine whether two different scientists with 
different choices for the end of the exponential phase obtain consistent results. Furthermore, these 
joint distributions can also be used to estimate the end of the exponential phase. 

Theoretical motivation for our results involves the desire to understand analogues of classical 
functional limit theorems for i.i.d. sequences that hold for multiple generations of the stochastic 
processes arising in the branching setting. What we present here deals with weak convergence results. 
Theorem 1 is our main result, and allows a large number of applications, a few of which are presented 
explicitly as Applications 1-3, and Theorems 2 and 3 in Section 2. Application 1 is a functional CLT, 
yielding a Donsker type result, Application 2 a Darling-Erdos result, and Application 3 an extremal 
process result, all obtained under best possible conditions. For example, in the functional CLT we 
use only second moments, and in the Darling-Erdos result we use the moment condition shown in 
[3] to be necessary for this result for i.i.d. sequences. A similar comment applies to the application 
to extremal processes. Here the regularly varying tail condition assumed is precis ely that required 
for the limiting maximal distribution at t = 1 to exist for an i.i.d sequence. Other applications 
are also possible once one has Theorem 1 available, but in Theorems 2 and 3 we turn to some 
applications of our functional CLT. Theorem 2 yields a strengthening of the functional CLT to the 
Banach spaces co.a(Co[0, 1]). Another consequence of Application 1 is a new proof of the CLT for 
the non-parametric maximum likelihood estimate of the mean of a supercritical branching process. 
A previous proof of this in [9] involves a martingale CLT, whereas the proof herein is an elegant 
application of our functional CLT result with t = 1, and the asymptotic independence obtained in 
the coordinates of the limiting process. Moreover, our proof allows us to extend this result to allow 
the application of a broad range of weights on the various generations. In [§] all the weights are 
equal to one. 

In order to describe our results in more detail we begin with a brief description of the branching 
process. Let {£, n .j,j > 1>ti > 1} denote a double array of integer valued i.i.d. random variables 
defined on the probability space (17, T, P), and having probability distribution {pj : j > 0}, i.e. 
P(£i,i = k) = pk- Then {Z n : n > 0} denotes the Galton- Watson process initiated by a single 



2 



ancestor Zq = 1. It is iteratively denned on (ft, J 7 , P) for n > 1 by 




Let to = E{Z\). It is well known that if to > 1 (i.e. the process is supercritical), then Z, 



'n 



00 



with positive probability and that the probability that the process becomes extinct, namely q, is less 
than one. The complement of the set U^ 1 {Z„ = 0} is the so called survival set, and is denoted by 
S. If to > 1, then P(S) = 1 — q and Z n — ► oo a.s. on S. Also, g = if and only if p a = 0. If to < 1, 
then assuming]?! 7^ 1 when to = 1, the process becomes extinct with probability one, i.e. P(S) = 0. 
To avoid degenerate situations we assume throughout the paper that po + Pi < 1- 

The paper is organized as follows: Section 2 develops the basic notation and states the main 
results of the paper. Section 3 contains the proof of Theorem 1, and Sections 4 and 5 that of the 
CLT applications in Theorem's 2 and 3, respectively. Section 6 contains examples providing some 
insight into the CLT for subcritical processes, and also for supercritical processes when one uses 
deterministic normalizations. In this latter example one does not get a Gaussian limit law, but a 
certain mixture of Gaussian laws. This mixture can be anticipated from the Kesten-Stigum result, 
but its precise expression requires some interesting analysis. In particular, these examples show 
precisely why the random normalizations used in our theorems are possibly the " best choice" if one 
wants classical results to persist in limit theorems for multiple generations of these processes. 

Acknowledgment. The authors thank the referee for a very careful reading of the manuscript, 
and for making several important suggestions that led to improvements in the paper. In particular, 
Lemma 3 as presented here is an elegant modification of our original arguments, and is based on the 
referee's comments. 



In this section we state the main result of the paper. This result allows us to obtain a wide variety 
of limit theorems for branching processes based on r(n)-generations, where 1 < r(n) < n. Following 
its statement we present some interesting consequences and applications. In particular, in these 
applications the integer sequence {r(n)} may approach infinity as n goes to infinity. As will be seen, 
they all follow rather immediately from our main result when combined with various classical limit 
theorems for i.i.d. sequences. 

Throughout (M, d) is a complete separable metric space with distance d, and M°° denotes the 
infinite product of copies of M with the product topology, metrized by 



In our applications M is the real line or some function space. If M is the real line, then the distance 
is the usual one, and for our functional CLT application M denotes the set of all continuous functions 
on [0,1] that vanish at 0, which we denote by Co[0, 1]. Of course, then Co[0, 1] is a Banach space in 
the suprcmum norm 



2 Notation and Main Results 




(2.1) 



</(/) = sup \f(t)\ 



(2.2) 



0<t<l 
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and the distance used is d(f,g) = q(f — g), f,g E Cq[0, 1]. Application 3 below contains a different 
choice of M, and others are certainly possible, but these suffice to provide a sampling of possible 
consequences of our main theorem. 

Since we want to study the asymptotic behavior of r(n) generations of the branching process, 
and r(n) may well converge to infinity, it is useful for these purposes to define 

^nj(n) = (-X"n,Z„_i>^n-l,Z„_ 2 ) ' ' ' -^n-r(n)+l,Z„_ r(n) > z -, z -, ' ' ' )> (2-3) 

where z is a fixed element in M, 

X n -j + l.Zn-j = ^Z„_j((fi-i+l,l: - - - ,Cn-i+l,Z„_j), (2-4) 

and the mappings Hk(-), k > 1, take R k into M are Borel measurable. We also define X„_j + i.o = z 
for 1 < j < r(ra) < n. Since in our results we condition on Z„_i > 0, the choice of z is immaterial. 
Hence 'K n ,r(n) is an element of the infinite product space M°° . Moreover, in our applications M 
always contains a zero element which we denote by 0, and if we take the fixed element z S M in 
(2.3) to be this 0, then we have 

Xn,r(n) = (^n,Z„_i, ^-n-X,Z n -i ^n-r(n)+l,Z n - rM > 0, 0, ■ • • ). (2.5) 

We will use => to denote weak convergence of probability measures. Our main theorem for the 
random vectors X„ r( -„) is the following. 

Theorem 1. Let m > 1 > pi, assume 1 < r(n) < n with linin^oo r{n) = oo, and that X n>r ( n j is 
defined as in (2.3)-(2.4)- Also assume that if : j > 1} are i.i.d. non-negative integer valued 
random variable with £(£i) = C{Z\), then the M-valued random elements {Hk : k > 1} used to 
define X„ r („) are such that 

H k (£ 1 ,--,Z k )=>H (2.6) 

on (M, d) . Moreover, assume that 



Then the probability measures 

M« = jC(X„ iT .(„) \Z n -i > 0) (2.8) 
converge weakly on (M 00 ,^), i.e. we have 

C(B 1 ,B 2 ,---), (2.9) 

where the Bi 's are independent copies of H . 

Remark 1. If m > 1, then \im n ^ ^ P(Z n >0) = 1 — q > and hence the condition (2.7) holds. 
If m = 1 and < a 2 = E{(Z\ — m) 2 ) < oo, then (2.7) also follows from Theorem 1 in [1, pl9]. 
Furthermore, as was pointed out by the referee, by Lemma 2 in UOj/ . there are interesting examples 
where m = 1 and a 2 = oo, yet (2.7) holds. 



,. P(Z n -i > 0) 
hm ; — 

rwoo P(Z n > 0) 



(2.7) 
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Next we present three immediate applications of Theorem 1. They include a functional CLT, 
a Darling-Erdos Theorem, and also an extremal process result. It is interesting to observe that 
the limiting distributions of the coordinates of X„ r („) are asymptotically independent, whereas the 
generations of the branching process itself are correlated. 

Application 1: Let m > 1, < a 2 = E((Z\ — m) 2 ) < oo, and assume 1 < r(n) < n with 
lim n _,oo r(n) = oo. Take M = Co[0, 1] with the sup-norm q, define Hk(£i, ■ ■ ■ ,£fc)(0) = 0, and for 
< t < 1 set 

Hk(ti, ■ ■ ■ ,&)(*) = —rf - m) + (tk - LtfcJ)— =(£ LtfcJ+ i - m). (2.10) 

I — l 

Then Donsker's Invariance principle implies (2.6) holds with C(H) the probability measure induced 
on Co[0, l] by a standard Brownian motion starting at zero when t = 0. If X„ r( -„) is defined as in 
(2.3-5) with Hk as in (2.10), and (Co[0, l])°° has the product topology induced when using the norm 
q on Co[0, l], then an immediate consequence of Theorem l is that the probability measures 

fl n =£(X„ ir („)|Z„_ l > 0) (2.ll) 

converge weakly there, i.e. we have 

fi>n => £{Bi,B 2l ■ ■ ■), (2.12) 
where the B^s are independent Brownian motions. 



Application 2: Let m > l, < a 2 = E((Z X - m) 2 ) < oo, lim^oo LLtE(Z 2 I(\Zi\ > t)) = 0, 
where Lt = \og e {t V e) and LLt = L(Lt). In addition, assume l < r(n) < n with limn^oo r{n) = oo, 
and take M — R 1 . Define 

Hkfa, ■•• ,&) = a k max ^=^~ w) - b k , (2.13) 
i<i<fe (7\/] 

where ak = (2LLk)^ and bk — 2LLk + ^LLLk — ^L(4tt). Then the Darling-Erdos Theorem as 
in Theorem 2 of [3] implies (2.6) holds with C(H) the probability measure induced on M by the 
cumulative distribution function 

G(x) = exp{-e~ x }, - oo < x < oo. (2-14) 

If X n r ( n ) is defined as in (2.3-5) with Hk as in (2.13), and M°° — R°° has the product topology , 
then an immediate consequence of Theorem 1 is that the probability measures 

fj, n = £(X„ !r .(„)|Z n _i > 0) (2-15) 

converge weakly there, and we have 

fJ-n => £(Bi,B 2 , ■■■), (2.16) 

where the B^s are independent random variables with cumulative distribution function G{x) given 
by (2.14). 
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Application 3: Let m > 1, assume F(x) = P(Z\ < x) < 1 for all x E R , and that 1 — F(x) 
is regularly varying at oo with exponent —a where a > 1. In addition, assume 1 < r(n) < n with 
lim, woo r(n) — oo, and that (2.7) holds. Then by Theorem 6.3, p. 455 of [I], there exists a,j > such 
that 

lim P(— max{0,£i,--- <x) =exp{-x~ a }, x > 0, (2.17) 

j— >oo 

and zero for x < 0. Now define the extremal process mk(t) which is zero in [0, —) and 

m k (t) = — max{0,a,--- ,&}, - < * < i = 1, - • • , fe - 1, (2.18) 

and 

m fc (t) = — max{0,Ci,--' ,6}, * > 1- (2-19) 

Let M denote the finite, non-decreasing functions z(t)on [0, oo) such that z(0) — and z(t) = z(l) 
for t > l.Then M is a complete separable metric space in the Levy metric dh on M , and if 

fffc(a,-" - ,Cfc)W=TO fe (t), 0<i<oo, (2.20) 

then by Theorem 2.1 and 3.1 of [7] we have (2.6) where {H(t) : < t < oo} is a Markov extremal 
process with sample paths in M. Therefore, if X n w n ) is defined as in (2.3-5) with Hk as in (2.20), and 
M°° has the product topology, then an immediate consequence of Theorem 1 is that the probability 
measures 

» n = jC(X ntr{n) \Z n _ x >0) (2.21) 

converge weakly there, and we have 

ix n ^ C{B U B 2 ,---), (2.22) 
where the B^s are independent copies of the Markov process {H (t) : < t < oo}. 

As is easily seen, Theorem 1 combined with other classical limit theorems for i.i.d sequences 
provides many possible limit theorems for suitable choices of the random elements X„ iT .( n ). However, 
what we turn to next are some applications and extensions of the functional CLT of application one. 
The first involves a functional CLT in Banach subspaces of (Co [0,1])°° determined by weighted 
analogues of the q-norm. That is, let A = {Aj : j > 1} be a sequence of strictly positive numbers, 
and for f = (f u f 2 , ■ ■ ■ ) G (C [0, 1])°° define 

g A (f) =supA,-||/ i ||, (2.23) 

where || • || is the supremum norm on Co[0, 1]. Also, let co,a(Co[0, 1]) be the subspace of (Co[0, 1])°° 
given by 

co.A(C o [0,l]) = {f =(A,/ 2 ,---)e (C [0,1])°° : lim X j \\fj\\=Q}. (2.24) 

j—>oo 

Then q\(t) is a norm making the subspace co,a(Co[0, 1]) a Banach space. 

As before we will use =>■ to denote weak convergence of probability measures. Our functional 
central limit theorem in co,a(Co[0, 1]) is the following. 
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Theorem 2. Let m > 1 and assume 1 < r{n) < n with linin^oo r(n) = oo. Also assume that the 
offspring distribution £(£) = C{Z\) is such that < a 2 = E((£ — m) 2 ) < oo and satisfies one of the 
following conditions: 

P(\£ - m\ > x) < f3e~ ex \ for all x > 0, or (2.25) 



E(\£ - m\ p ) < oo for some p > 2, (2.26) 

and that r(n) = o(n). Let X„ jr („j be defined as in (2.3-5) with • • • as in (2.10). If (2.25) 

holds and we take A = {Xj} where Xj = (Sj log(j + 3)) - ^ and limj^oo Sj = oo, then on the Banach 
space Co, a (Co [0, 1]) the probability measures 

M„ = £(X„ ;r( „ ) |Z„_ 1 >0). (2.27) 

are such that 

p n ^jC(B u B 2 ,---), (2.28) 
where the Bi 's are independent standard Brownian motions. If instead we assume (2.26) and X = 

11 + 5) 

{Xj} where Xj = j » and 8 > 0, then we again have (2.28) on Co,a(Co[0, 1]). 

Remark 2. The condition r(n) = o(n) in Theorem 2 can be weaken somewhat. For example, if 
m = 1 and < a 2 = E((£ — m) 2 ) < oo we need only assume that limsup„^ oc < 1. Similarly, 
if m > 1, we may replace r(n) = o(n) by lim„^ 00 (n — r{n)) = oo. Both of these improvements 
follow from refinements of (4-7)-(4-8) of Lemma 4 below, and were pointed out by the referee. We 
emphasize that throughout the assumption lim^—xx, r(n) = oo is in effect. 

If G is Gaussian random variable with mean zero and variance one, then for all x > 

P{ sup B(t) <x) = P(\G\ < x). 

0<t<l 

Hence Theorem 2 and the continuous mapping theorem applied to the processes {X„ ;T .(„)(-) : n > 1} 
with values in Co,a(Co[0, 1]) immediately imply the following result. 



Corollary 1. If (2.25) or (2.26) holds with corresponding {Xj : j > 1} as indicated, then the 
conditions of Theorem 2 imply that 

lim P{ max Xj ( Zn ~ j+1 ~ mZn ~i> < x \z n _ x > 0) = P(supXjGj < x), 

n^oc l<j<r(n) 7? ?>1 

where G\, G2, • • • are i.i.d. N(0, 1) random variables. In addition, we also have 
lim P( max Xj( 3 , — ■ — ) < x\Z n -i > 0) = P(sup Xj\Gj\ < x). 

n^oo \<j<r(n) 72 j>\ 



Remark 3. If one wants results similar to those of the corollary with Xj = 1, then Theorem 2, or 
application one applies, as long as we restrict the maximum to be over only finitely many j 's, say 

ie{i,2,-,rf}- 
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In Theorem 3 below we obtain a CLT for ratios of weighted sums of a supercritical branching 
process {Z n : n > 1}. When the weights are all one the result appeared in [5] using a martingale 
CLT for the proof. Our proof is completely different. It uses Application 1 in an important way and 
allows the ratios to consist of weighted sums. We begin with some notation. 

In Theorem 3 we assume {bj : j > 1} is a sequence of non-negative numbers and set 

X n = (— - - m)\[~L\ x , 

where 

N n = biZ n + b 2 Z n -i H h b n Zi, 

D n = hZn-i + b 2 Z n - 2 H h b n Z , 

and we understand X n to be zero if D„ = 0. Then we have the following CLT. 

Theorem 3. Let to > 1, < a 2 = E((Z\ — to) 2 ) < oo, and assume {bj : j > 1} is a sequence of 
non-negative numbers with 



< K = V" — -V . 



For k>l, let 



% = (2-29) 



and set 

rr 2 00 b 2 

K TO^ 
3=1 

Then, A 2 < oo, and /or a/Z reaZ x we ftawe 

lim P(X„ < a;|5) = P(G < ar), (2.30) 

n — >oo 

where G is a mean zero Gaussian random variable with E(G 2 ) = A 2 . In particular, if bj = or 
bj = 1 for all j > 1 and some 6j > 0, iften A 2 = cr 2 and (2.30) holds with G a mean zero Gaussian 
random variable with E(G 2 ) = a 2 . 

Remark 4. If we condition on {Z n ^\ > 0} instead of S in Theorem 3, the limit is the same. 

3 Proof of Theorem 1 

The proof of Theorem 1 is based on a lemma for weak convergence in infinite product spaces, and 
an iterative technique developed in Lemma 3 below. This iterative lemma also is applicable to the 
proof of Proposition 1, which appears in Section 6. 

Let (M,d) be a complete separable metric space, fj, a Borel probability measure on (M,d), and 
7r : M — > M Borel measurable. Define, 
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for all Borel sets A of (M, d). Let M°° denote the infinite product space with the product topology 
and typical point s = (si, s^, • ■ • )• If z is a fixed point in M, we define the mapping tti : M°° — > M°°, 
for J > 1, by 

7T;(s) = (S1,S 2 ,-" - ,). 

We now indicate a lemma concerning weak convergence in product spaces. Its proof is easily antic- 
ipated. 

Lemma 1. Let M be as above and assume {/z n : n > 1} and /Kqo are Borel probability measures 
on M°° with the product topology. Then {fj, n : n > 1} converges weakly to if and only if 
converges weakly to fi^ for all I > 1 . 

The next lemma is used in the proof of Theorem 1 . When m = 1 the result follows from Theorem 
2, p. 20, of PP, and when m > 1, a bit of calculation shows that it follows from Lemmas 1 and 2, p. 
4-5, of pQ. 

Lemma 2. Lei {Z n : n > 0} be a Galton- Watson process with Zq = 1. If m> 1 > p\. then for each 

J e [l,oo) 

A p(z„>o) =0 - (3 - 1} 

Proof of (2.9) Let /i denote the probability measure induced by H on M, and /Zoo be the 
infinite product measure formed by \x on M°°. Also let p, n denote the law of X„ r („) when Z n _\ is 
conditioned to be stricty positive, i.e. for A a Borel subset of M°° we have 

Hn(A) = P(X n>r{n) e A\Z n ^i > 0). 

By Lemma 1 it is sufficient to establish, for each I > 1, the weak convergence of to /i^. If we 
identify the range space of 717 with M l in the obvious way, then it suffices to show that on M l we 
have that 

A„ = C(X ni z„ ._i,-Xn-i,z„_ 2 ! ' ' ' > Xn—i+i,z n -i\Z n —i > 0) 

converges weakly to (fJ,) 1 , the 1-fold product of /i on that space. 

To establish weak convergence of A„ to it is sufficient by Theorem 2.2 of [2] to show for 
arbitrary continuity sets Ei of the measure \x on M that 

/ 

lim A„ (Si x E 2 x • • ■ x Ei) = TT n{E s ). (3.2) 



j'=i 



We will now verify 



Lemma 3. Let {Z„ : n > 0} 6e a Galton-Watson process with Zq = 1, m > 1 > px, and 
holding. Also let J-q = {0, f2} and T n = o~({£k,j :j>l}:l<fc< «) for n > 1. Let I > 1 be 
an integer and let {/3 n ,i,7n,i : 1 < i < / < fi < 00} 6e random variables such that p n ^ is T n -i 
measurable, 7„^ is j^Vi-i+i measurable for all 1 < i < I < n, and for some constant c > we have 
\0n,i\ < c a.s. and \"f n ,i\ < c a.s. for all 1 < i < I < n. Suppose for every i = 1, • • • , Z i/iai 

— /z„_ l>0 £(7rM " /3n,i|^n-i) -» ill L X (P), (3.3) 
Qn—i 
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where q n = P(Z n > 0) for all n > 1, and set 

i i 

r n ,i = I{Z n - z >0} Y\_ T". 1 " ^".» = ^{Z„_,>0} ]^[ Ai.i/) (3-4) 

for every 1 < i < I < n. Then for every j = 1, • • • , Z we Ziawe 

1 ^(rnj-BnjI^n-O-OinL^P) (3.5) 



lim £(TT 7njl - TT 7nii |Z n _j > 0) = 0. (3.6) 



n^oo 



//( maps the non-negative integers into [0, oo) with liim^oo ((k) = 0, then we also have 

lim E(C(Z n )\Z n > 0) = 0. (3.7) 

n— >oo 

Proof. First we observe that (3.6) follows from (3.5) by taking expectations. The proof of (3.5) 
goes by backwards induction on j. By (3.3) with i — I, we see that (3.5) holds with if j = I, so now 
let 1 < j < I be a given integer such that (3.5) holds for j + 1. 

Set Fk = {Zk > 0}. Then the F^s decrease as k increases, and 



Hence, if 



we then have 



where 



and 



TnJ - B n j — iFn-jlnjFnj+i - I Fn _ j P n jB n j + i, 
%,j = iFn-jInj and n>i = iFn.j.j/Jnj, 

E({T nJ - B ntj )\^ n -i) = I n + II n + III n , (3.8) 

In = ~ $ Tlt j)T n> j + i\J : n -i), (3.9) 

H n = E(f3 n j(T n j + i - B n ^ + i)\T n -i), (3.10) 

IIIn = E(Pn,j(Bn,j+l - ^F n _ 3 - B n ,j+l) I Fn-l ) • (3-11) 



Therefore, it suffices to show that the three quantities ^^-,^^7, and |^ converge to zero in 
L\P). 
Let 

Then, since |7 n ,»| < c, r n .j + i is -Fn-j measurable, and 1 < j < £, we have 

£?(|/„|) = E{\E{E[\ ntj \^ n _ j ]T ntj+1 \T n _ l )\) < msK{lJ}E{\E[(%j - f3 nj )\T n ^}\) (3.12) 
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Since \(3 n ,j\ < c i we have |/F„_j/3n.j — Pn,j\ < clF njir] F^_,, and hence (2.7) implies that 

lim q-^EQI^.^j - p nJ \) = 0. (3.13) 
Hence by combining (3.3) with i = j, and (3.13), we see that 

hm q-i j E(\A nJ \) = 0. (3.14) 

Thus converges to zero in L X (P). 
Next we observe that 

Pn,j^n,j+l = Pn,j^n,j+1 an d /3 n jB n j + i = /3 n jB n j + i. (3.15) 

Thus /3„j being T n -i measurable implies 



q„ 

and 



TI " =0 n jE((r ntj+1 -B nJ+1 )\r n - l )/q n - j (3.16) 



(7n— j 

Therefore, 

E(\Hn\) 



(3n,jE((B n j + i — I Fn _ ] B n j + i)\T n -i) / q n - r (3-17) 



< cE(\E(T nJ+1 - B„ J+1 )|jT„_ i )|)/ gn _ j , (3.18) 



and since the induction hypothesis provides (3.5) for j + 1 and (2.7) holds, we have that (3.18) 
implies 

li m ^™=0. (3.19) 
Using (3.17), a similar argument implies that 



provided we show 



hm«M=0 (3.20) 

ra->oo q n _j 



hm E (\ B ^ + i-lF^B n , +1 \) = Q (3 21) 



Now iFn-jlFn-j-! = -^f„_j, and hence 

i 

|S nJ+ i - /F n _ 3 S n , j+1 | = K/^.,^ - 1] ^."1 ^ ^{^c'K^-.-in^.- (3.22) 

i/=j+i 

and hence (2.7) implies (3.21). 

Thus the induction holds, and to complete the proof of Lemma 3 it remains to verify (3.7). Hence 
let e > be given, and choose k e > 1 and c > such that ((h) < e for all k > k e and ((k) < c for 
all k > 1. Then we have 

E(I {Zn>0} ((Z n )) < eP(Z n > k t ) + cP{\ < Z n < k e ), (3.23) 
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and (3.7) follows from (3.1) of Lemma 2. I 



Proof of (3.2). Set 9 (B) = I B (z) and 9 k {B) = P{H k {£ u ■ ■ ■ , £ fc ) G B) for B a Borel subset of 
M and k > 1, and let A„ f j = {X„_i + i i 2„_ i € J5<} for 1 < i < / < n. Then, 



where d(k) = |0fc(£ , i ) — for k > and 1 < i < Z. Since the sets Ei are assumed to be 

continuity sets for the measure //, (2.6) implies 



and hence (3.7) implies (3.3) of Lemma 3 with y n j = Ia„ 4 , & n ,% = M(-^i)) an d 1 < * < ^- Therefore, 
the assumptions of Lemma 3 hold, and hence (3.2) follows by taking j = 1 in (3.6). Hence Theorem 
1 holds. ■ 

4 Proof of Theorem 2 

Application 1 of Theorem 1 implies (2.28) on (Co[0, 1])°° with the product topology. Now we turn 
to its proof for the spaces co,a(Co[0, 1]) and their stated norms q\. Given that weak convergence 
in the product topology implies the finite dimensional distributions of any finite set of coordinates 
converges in correct fashion, it suffices to show the probability measures of (2.27) are tight on the 
spaces Co,a(Co[0, 1]). This is the content of our next lemma. Its proof establishes Theorem 2. 

Lemma 4. Let {fj, n : n > 1} be as in (2.27), assume m > 1, and that r(n) — > oo, with r(n) = o(n). 

If (2.25) holds and Xj = (5j log(j + 3))~ 5 ( where limj^oo 5j = oo, then the {fi n : n > 1} are tight on 

(i+g) 

co,a(Co[0, 1]). Similarly, if (2.26) holds and Xj = j t> for S > 0, then we also have : n > 1} 
tight on c 0:X (C [Q, 1]). 

Proof. Since the finite dimensional distributions of any finite set of coordinates of {/x ra } converge 
weakly to the corresponding ones for £(Bi,B 2 , ■ ■ ■ ) , standard arguments allow us to finish the proof 
by showing the are tight on co,a(Co[0, 1])). 

To establish tightness we apply the remark in [8j, p. 49. To show this remark applies we use the 
fact that the distributions of any finite set of coordinates are tight (since they are convergent), and 
therefore it suffices to show for each e > that there exists a d(e) such that d > d(e) implies 



Here Qd(f) = (0, • • • ,0, fd+i, fd+2, '••) for f e (C o [0, 1])°°. Since we are assuming r{n) tends to 
infinity, for all n sufficiently large we have 



P^KAFn-i) = 0z n _i(Ei) and |E(I A: 



^(EOI^n-01 = Ci(Z n _i), 



lim Q(k) = 0, 



limSUp P(qx(Qd(X-n,r(n))) > € \ Z n-l > 0) < 6. 



(4.1) 



r(n) 



P(qx(Q d (X nA n))) > t\Zn-l > 0) < 7 ' 



]=d+l 



where 



P( max 

l<KZ n 



—j 
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Setting J n j — I n jP(Z n -\ > 0) we see 

oc I 



J «J = !>2 P ( max I y^(£«-j+i,fc - m )\ ^ r " eA 7 \ Z n-j = r, Z n -i > 0). (4.2) 

* — ' KKr c — ' J 

r=l fc=l 

Thus 

oo I 

Jn.j <J2P( max | J2(!; n -j + i,k ~m)\> rhXj^Z^ = r)P(Z n _j = r), (4.3) 

' — ' KKr * — 4 J 

r=l fc=l 

and by the branching property we see 

oo / 

J n ,j < E 1 - m )l ^ ^eAT 1 )P(Z„_ J = r), (4.4) 



where : fc > 1} are i.i.d. with law that of the offspring distribution. Since A^ ■ 1 — * oo there exists 
a jo = Jo(e) such that j > jo and Ottavianni's inequality implies 



- \-l 



J„j < 2^P(|E(e fe -m)| > -^-)P(Z n _ j =r). (4.5) 

r=l fe=l 

Now under (2.25), Lemma 4.1 of [H] implies for all r > l,j > 1 that 

P(\ - TO )I ^ -y^) ^ 2cxp{-^ 2 A7 2 /(64/3)} = 2(j + 3)"^, 
and hence for j > jo we have 

< 4(j + 3)-^P(Z„_, > 0)/P(Z„_! > 0). (4.6) 

When in = 1 and < Var(Z\) = a 2 < oo, we have by Theorem 1, p. 19, of [T] that linin^oo nP(Z n > 
0) = 2/er 2 . Hence for n — j > uq we have 

P(Z»„ J - > 0) _ (n - OT^j > 0) (n - 1) < (1 - §) 2 

P(Z n _ 1 >0) (n-l)P(Z n _!>0)(n-i) " (i-i) " (l-i)' K ' } 

Thus for j = o(n), j > jo, we have 

I n ,j < 16(j + 3)-^. (4.8) 

Now take j! = ji(e) such that j > ji implies 6ij3 j > 2. Given e > 0, r(n) — o(n) and d > d Q (e) = 
max (j , ji, ^ + 1), we have 

r(n) 

limsupP(gA(Qd(X n!r(n ))) > e|Z„_i > 0) < limsup V] I nJ < e. 

j=d+\ 

Hence the lemma is proven under (2.25) if m = 1. If m > 1, then (4.8) is an even easier consequence 
of (4.6) since hin^^oo P(Z n > 0) = 1 — q > 0. Hence if r(n) = o(n) , the lemma also holds in this 
case. 
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If (2.26) holds, then for all r > and p > 2 we have a constant B p < oo such that an application 
of Markov's inequality and Corollary 8.2 in [3], p. 151, implies 

p ( iB & -™)i>^>< B .fflf^*). 

k =i (-J-)p 

_i (l+a) 

Hence the arguments can be completed as before, since under (2.26) we have A^ — j p ■ Thus 
the lemma is proven. I 



5 Proof of Theorem 3 

Before we turn to the proof of Theorem 3 we provide a brief lemma, and recall that if to > 1 and 
< (J 2 = E{{Z\ — m) 2 ) < oo , then the Kesten-Stigum theorem, [TJ, p. 9 (also see p. 24), implies 
that with probability one that 



lim W n = W, (5.1) 

n — >oo 

where W n = J^-, and W > almost surely on the survival set S. 
Lemma 5. Under the given assumptions, we have almost surely that 

li m = y X W . (5.2) 
n—>oo to™ z — ' mi 



TO J 

3>1 



Furthermore, for k > 1 almost surely on S we have 



lim **fp* = 6 k , (5.3) 

where 8 k is given as in (2.29). 
Proof. Observe that 

=/ = > — tW + > — r( r - Wj, 

TO™ z — ' TO™ L — ' 771 z — ' 771 TO™ K 

fc=l fc=l fc=l 

and since \im n ^ ^ Z n /m n = W almost everywhere by (5.1), an elementary argument combining 
k < oo and to > 1 easily implies lim^oo Ylt=x jffr( m n-t — W) = almost everywhere. Thus (5.2) 
holds. Combining (5.2) and (5.1) with W > almost surely on S, we thus have (5.3). Hence the 
lemma is proven. I 

For the proof of Theorem 3 recall that if D n — 0, then we understand X n to be zero. Furthermore, 
if D n > 0, we then have 



3=1 

and for 1 < d < n we define 



Zn-j+1 
Z n -j 



to), 



x n , d = £ Jh^i^b-z— { ^i±i _ m) . 

3=1 J 
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Of course, when D n = 0, we understand X n and X n ^ as given in these formulas to be zero. We also 
use 

X n = \ — ~X n and X n a = \ — — X n a, 
V m™ V rn n 

and their formulas analogous to those above for X n and X n ^. 

Proof of Theorem 3. Take e > 0, and to simplify the notation set j n — (^)'- Then 

P(X n <x\S) = P(X n < x ln \S), (5.4) 

P(X n < x ln \S) < P{X n4 <(x + e) ln \S) + P(\X n - X n4 \ > e ln \S), (5.5) 

and 

P(\X n - X n4 \ > e ln \S) < P(\X n - X n4 \ > eS\S) + P(0 < 7n < 5\S). (5.6) 

Since e > is given, we choose 6 > sufficiently small that P(0 < W < 25 2 )/P{S) < e. Since 
linv^oo 7„ = Wi > almost surely on 5*, there exists no = no(5) such that n> no implies 

P(0 < 7 „ < 8\S) < e. (5.7) 

Once e, 5 > are fixed, we choose do = do(e, S) such that d > do implies that uniformly in n 

P(\X n - X n4 \ > e6\S) < e. (5.8) 

To obtain do we observe that P(\X n — X n ^\ > c5\S) < P(\X n — X n ^\ > e5)/P(S), and since k < oo 
and the branching property easily implies 

n n 

E((X n -X n , d ) 2 ) = Yl rn-^EiZ^^a 2 = £ <j\m^ +1 \ 

j=d+l j=d+l 

we have do = do(e, 5) such that d > do implies 

E((X n -X n4 ) 2 ) <e 3 S 2 . (5.9) 

Hence Markov's inequality, (5.9), and the above reasoning allows us to choose do independent of n, 
so (5.8) holds. 

Since P(X n ^d < x + e\S) — P(X n ^ < (x + e) 7n |S), by combining (5.4)-(5.8) we have for d > do 
that 

P(X n < x\S) < P{X n4 <x + e\S) + 2e. (5.10) 
Similarly, we also have for d > do that 

P(X n < x\S) > P(X ntd <x- e\S) - 2c. (5.11) 

Now let 

^^EV^VP^^-m). (5.12) 

3 = 1 J 
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and observe that by setting t = 1 in the functional CLT of Application 1 , the continuous mapping 
theorem immediately implies the uniform stochastic boundedness of 

{y/bJZ~(?Z±!± - m) : 1 < j < d, n > 1} 

when these variables are conditioned on the event {Z n _\ > 0}. Therefore, for each fixed d we have 
from (5.3) of Lemma 5 and the previously mentioned uniform stochastic boundedness that 

lim P(\X n , d -X' nd \ >e|S) = 0. (5.13) 

n — >oo 

In addition, by the CLT provided by Application 1 and the continuous mapping theorem we easily 
have 

lim P(x' n d < x\Z n _ x > 0) = P(G d < x) (5.14) 

n^oo 

for all real x, where G d is a mean zero Gaussian random variable with variance A d = Y^j=i OjbjV 2 - 
Since the events {Z n -i > 0} I S with P(S) = 1 — q > 0, by combining a standard argument 
implies (5.10), (5.13), and (5.14) we have for all d > do and all real x that 

limsupP(A„ < x\S) < P{G d < x + 2e) + 3e. (5.15) 

n — *oo 

Using (5.11), a similar argument implies for all d> do and all real x that 

liminf P(X n < x\S) > P(G d < x - 2e) - 3e. (5.16) 
Now take di = d\ (e) sufficiently large such that d> d\ implies for all real x that 

P(G d <x)-e< P(G <x)< P{G d < x) + e, (5.17) 

where G is as in the proposition. This condition follows easily since h? d — > A 2 < oo. 
Letting d tend to infinity in (5.15) and (5.16), (5,17) implies for all x that 

limsupP(A„ < x\S) < P(G < x + 2e) + 3e, (5.18) 

n^oc 

and 

liminf P(X n < x\S) > P(G < x - 2e) - 3e (5.19) 

n^oc 

Letting e J. in (5.18) and (5.19), we have (2.30). Hence the the theorem is proven as the last claim 
is immediate from (2.30) . I 



6 Examples 

In this section we provide some examples where the CLT fails. We focus on the CLT as it is perhaps 
the result one might expect would be most likely to persist under suitable modifications of our basic 
assumptions. In the first example failure results from our branching process {Z n : n > 0} being 
subcritical. Hence, even though one has the same conditional independence structure as in the 
critical and supercritical cases, its behavior is quite different. In the other example the CLT fails 
through the use of deterministic normalizers. 
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Subcritical Branching Fails the CLT: Our result concerns the limit of 

and shows that even for this single distribution the CLT always fails. This is easy to see since the 
distribution of all the H k 's of the following lemma are discrete. 

Lemma 6. Assume that E(Z 2 ) < oo and set L n — z Zn - — m). Then, for any x € IR, 

lim P(L n < x\Z n -x > 0) = V P(VkH k < x)9 k , (6.1) 

•where Hf. — rSj=ite — m )> {£2 : * > 1} ar ^ i-i.d. with £(£i) = C(Zi), and {9 k : k > 1} is a 
probability distribution. 

Proof of Lemma 6. Let igE. Then, by the branching property we easily have 

P(L n < x\Z n -i > 0) = P ( Ln ^ x ' Zn - 1 = k \Z n -i > 0) (6.2) 

fc>0 

= J2 P (^H k < x)P(Z n ^ = h\Z n - X > 0). (6.3) 

k>l 

Since m < 1, Yaglom's Theorem on p. 18 of [T] implies that linin^oo P(Z„_ 1 = k\Z n _i > 0) = 9 k . 
where {9 k : k > 1} is a probability distribution. Thus, by the generalized dominated convergence 
theorem, it follows that 

lim P(L n < x\Z n > 0) = V P(VkH k < x)6 k . (6.4) 

k>l 

This completes the proof of the lemma. I 

Deterministic Normalizers Prevent the CLT: Even when m > 1, the next example shows 

the spatial finite dimensional distributions related to Application 1 fail to be Gaussian when we 

„_i i 

use canonical deterministic normalizations m 2 instead of Z^_ x in our CLT results. Of course, 
the motivation for these normalizations results from the Kesten-Stigum result, see (5.1), and in this 
situation the limit laws are a mixture of Gaussian laws and the random variable W that appears in 
that result. 



Proposition 1. Let m > 1, < a 2 = -E((£i,i — m) 2 ) < 00. For i = 1, • • • ,1 < n, set H n _i + i = 

md zero otherwise, and let 

B n>i = {H n - i+1 < U}, (6.5) 



("-') x 

— ( g '+ 1 _ TO ) when Z n ^j > 0, and zero otherwise, and let 



where ti, ■ ■ ■ , U £ (—00, 00) . Then 

lim P(B nA n • • • n B n ,i n {z n ^ > 0}) - E^W 1 ' 2 ) ■ ■ ■ ^(w 1 / 2 )^), (6.6) 

n — >oo 

where Sq = {linin^oo z Zn - — m} and W is as in (5.1). 
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Proof of Proposition 1. Let {£j : j > 1} be i.i.d. copies of Z\, and set 

k 

h(0,t) = / [0 ,oo) and h(k,t) = P(^(6 - m) < <rk*t) 

i=l 

for fc > 1 and t G i? 1 . Then P(_B„^|^ r „_ i ) = h(Z n - i7 W n ~iti) and by the central limit theorem we 
have 

C(fc) ee sup \h(k,s) - $(s)| -» 0, 
where $(•) is the standard Gaussian cumulative distribution function. Therefore, we have 

\E(I Bn . - i{W n -iti)\^ n -i)\ < C(Zn-i) + mw n -iU) - HW n -iU)\, (6.7) 

and since m > 1, we have P(Z„ > 0) J, 1 — g = P(So) > and (2.7) holding. Moreover, since 
< a 1 < oo we have by (5.1) that W n W a.s. and Z n — > oo on So- Hence the right hand side 
of (6.7) tends to zero on So as n tends to infinity, and the argument yielding (3.7), and {Z n>0 } J. S 
with P(SASa) = implies for all i = 1, • • • , I that 

E(\E(I Bn . - ^(W n ^U)\T n ^)\ \Z n ^ > 0) -» 0. (6.8) 

Hence we see that {"in.ii fin, i\ = {lB n ,n'&(Wn-i'ti)} satisfies (3.3), as well as the remaining assump- 
tions of Lemma 3. Therefore, Proposition 1 follows from (3.6) of Lemma 3. II 
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